Robust sub-Gaussian estimation of a mean vector in nearly linear time

نویسندگان

چکیده

We construct an algorithm for estimating the mean of a heavy-tailed random variable when given adversarial corrupted sample N independent observations. The only assumption we make on distribution noncorrupted (or informative) data is existence covariance matrix Σ, unknown to statistician. Our outputs μˆ, which robust presence |O| outliers and satisfies (1)‖μˆ−μ‖2≲ Tr(Σ) N+ ‖Σ‖opK with probability at least 1−exp(−c0K)−exp(−c1u), runtime O˜(Nd+uKd) where K∈{600|O|,…,N} u∈N∗ are two parameters algorithm. fully data-dependent does not use (1) in its construction, combines recently developed tools median-of-means estimators covering semidefinite programming. also show that this can automatically adapt number (adaptive choice K) it same bound expectation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sub-Gaussian mean estimators

We discuss the possibilities and limitations of estimating the mean of a real-valued random variable from independent and identically distributed observations from a nonasymptotic point of view. In particular, we define estimators with a sub-Gaussian behavior even for certain heavy-tailed distributions. We also prove various impossibility results for mean estimators.

متن کامل

Sample-Optimal Density Estimation in Nearly-Linear Time

We design a new, fast algorithm for agnostically learning univariate probability distributions whose densities are well approximated by piecewise polynomial functions. Let f be the density function of an arbitrary univariate distribution, and suppose that f is OPT close in L1-distance to an unknown piecewise polynomial function with t interval pieces and degree d. Our algorithm draws n = O(t(d+...

متن کامل

A Robust Adaptive Observer-Based Time Varying Fault Estimation

This paper presents a new observer design methodology for a time varying actuator fault estimation. A new linear matrix inequality (LMI) design algorithm is developed to tackle the limitations (e.g. equality constraint and robustness problems) of the well known so called fast adaptive fault estimation observer (FAFE). The FAFE is capable of estimating a wide range of time-varying actuator fault...

متن کامل

F eb 2 01 7 Sub - Gaussian estimators of the mean of a random vector ∗

We study the problem of estimating the mean of a random vector X given a sample of N independent, identically distributed points. We introduce a new estimator that achieves a purely sub-Gaussian performance under the only condition that the second moment of X exists. The estimator is based on a novel concept of a multivariate median. 2010 Mathematics Subject Classification: 62J02, 62G08, 60G25.

متن کامل

Robust Estimation in Linear Regression with Molticollinearity and Sparse Models

‎One of the factors affecting the statistical analysis of the data is the presence of outliers‎. ‎The methods which are not affected by the outliers are called robust methods‎. ‎Robust regression methods are robust estimation methods of regression model parameters in the presence of outliers‎. ‎Besides outliers‎, ‎the linear dependency of regressor variables‎, ‎which is called multicollinearity...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Annals of Statistics

سال: 2022

ISSN: ['0090-5364', '2168-8966']

DOI: https://doi.org/10.1214/21-aos2118